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To adapt their behaviour in changing environments, cells sense concentrations by binding external 
ligands to their receptors. However, incorrect ligands may bind nonspecifically to receptors, and 
when their concentration is large, this binding activity may interfere with the sensing of the ligand 
of interest. Here, I derive analytically the physical limit to the accuracy of concentration sensing 
amid a large number of interfering ligands. A scaling transition is found when the mean bound 
time of correct ligands is twice that of incorrect ligands. I discuss how the physical bound can be 
approached by a cascade of receptor states generalizing kinetic proof-reading schemes. 


Because of their small sizes, biological systems typ¬ 
ically operate with only a few copies of the molecules 
they sense and communicate with. In their pioneering 
work, Berg and Purcell derived the fundamental bound 
that the noise arising from these small numbers sets on 
the accuracy of concentration sensing [T]. Experimental 
progress in the characterization of single-cell variability 
[2] and sensing precision [3] has fueled a renewed inter¬ 
est in small-number noise and its implications for infor¬ 
mation processing [IHS]- General or refined bounds on 
sensing accuracy have been recently derived for single 
receptors [3i], and extended to spatial [lUHUi or tem¬ 
poral m gradient sensing, while the metabolic cost and 
trade-offs of sensing accuracy have been explored 
[35] . Much of this past work has assumed perfect speci¬ 
ficity between the biological receptors and their cognate 
ligands. In realistic biological contexts, large numbers of 
spurious ligands may bind receptors nonspecifically, in¬ 
terfering with the ligand of interest [35] . This is the case 
in the problem of antigen recognition by T-cell receptors, 
where cells must react to a small number of specific for¬ 
eign peptides among a large number of nonspecific self¬ 
peptides m- Biochemical network architectures based 
on kinetic proofreading [281 ESI have been shown to pro¬ 
vide a solution to the discrimination problem, and have 
been studied in depth theoretically [351 13011321 . How¬ 
ever, no fundamental bound has been derived against 
which to compare the performance of these solutions, 
save for Ref. [33] where concepts of statistical decision 
theory were used to derive the minimal decision time to 
detect cognate ligands. In this paper I derive the fun¬ 
damental limit on concentration sensing accuracy and 
ligand detection error in the presence of a large number 
of spurious ligands. The maximum likelihood estimate 
achieving the bound can be implemented biologically by 
simple networks based on push-pull reactions. 

Consider a mixture of two ligands, only one of which 
the biological system wishes to sense. The ligand of inter¬ 
est (hereafter referred to as the correct ligand) is present 
in concentration c, while the interfering or spurious lig¬ 
and (called the incorrect ligand) is present in concentra¬ 
tion d. The biological unit can sense ligands through 
N identical receptors, which can be bound by either 


• correct ligand 

• incorrect ligand 
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FIG. 1: Reading concentrations off trajectories of re¬ 
ceptor occupancies. Cognate (correct) and spurious (incor¬ 
rect) ligands may bind N receptors, typically presented on the 
cell surface, with rate kc and kd. The incorrect ligands are 
in excess, d > c, but lead to shorter binding events, d > r. 
The information the cell can theoretically use is contained in 
the time traces of occupancy of all receptors (green curves). 
The maximum likelihood estimate fully exploits these traces 
to optimally infer the input concentrations c and d. 


ligand with a common rate k = ADa, where D is the 
molecule diffusivity and a the effective receptor size. Re¬ 
ceptors can distinguish between the two molecules thanks 
to their higher affinity to the correct ligand. Physically, 
this means that the unbinding rate r of the correct ligand 
is smaller than that of the incorrect ligand d > r. 

The occupancy of each receptor, 

kcr~^ + kdd~^ 

^ Id- kcr~^ + kdr'~^ ’ 

depends on both concentrations, and cannot be used 
alone to determine c. The interchangeability of the ratios 
c/r and d jd in this expression emphasizes the ambiguity 
between many incorrect ligands and a few correct ones. 
To discern these two effects, one must use the full tempo¬ 
ral record of occupancy of each receptor. The probability 
distribution for the binding and unbinding events at all 
receptors during a time interval T reads: 
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P = Y[ (kcre-^^^ + , (2) 
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with Ctot = c + c' is the total concentration of ligands, 
is the total unbound time accrued over all receptors, and 
ti,... ,tn the durations of the n binding events occurring 
at all N receptors during T. The log-likelihod C = In P 
can be rewritten as a sum of three independent contribu¬ 
tions, C — Co + Cl + £ 2 , where Cq depends on neither c 
or c', and where Ci and £2 pertain to the unbound and 
bound intervals respectively: 

£l(Ctot) ~ ^ fn Ctot ^Ctot^u (3) 

n 

£2(2:) = ^ In — a;-I-, ( 4 ) 

i=l 

where x = c/ctot is the fraction of correct ligands, and 
a = r/r' < 1 is the binding constant ratio. As can be 
seen in the respective dependencies of £1 and £2 upon 
Ctot and X, unbound intervals are informative of the total 
concentration, while the bound intervals are informative 
of the fractions of ligands. 

The maximum likelihood estimate for the total con¬ 
centration is obtained by the condition dCi/dctot = 0, 
which gives cJot = kTu/n. The error made by this esti¬ 
mate is given in the large time limit by the Cramer-Rao 
bound, which sets the best possible performance of any 
estimator |34) : 






Ctot 


n ADa{l-p)NT' 


(5) 


with <5ctot = Ctot ~ This result is that obtained in 
[5] for a single ligand, where the maximum-likelihood er¬ 
ror was shown to be half as small as the classical Berg 
and Purcell bound [1] based on the average receptor oc¬ 
cupancy. The reason for this difference is that the maxi¬ 
mum likelihood estimate is not affected by the noise due 
to the stochastic nature of receptor unbinding, as evident 
in Eq. In the case of a mixture, the receptor occu¬ 
pancy Eq. Q depends on x as well as Ctot, and does not 
even suffice to determine the total concentration. 

The fraction x of correct ligands can be estimated by 
maximum likelihood as well, by solving: 


dC2 

dx 


" c,eii-o‘yu _ 1 


( 6 ) 


The error can be estimated from the Cramer-Rao bound 
(App. A.l): 


{5x^) 


with f{x,a) ^ 
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The total error in the concentration of the correct lig¬ 
and c = xctot is then the sum of the (independent) 
errors in Ctot and x from Eqs. (§ and 0: (<5c2) « 
cLtix^ + f{x,a))/n. 



FIG. 2: Physical bound on concentration sensing error. 

The fundamental bound on the relative error in the fraction 
of correct ligands x = c/ctot scales with the inverse of the 
number of binding events n. Here the rescaled error fix, a) = 
n{6x'^) [Eq. 0], is represented as a function of the binding 
constant ratio a for various values of x. There are two distinct 
scaling regimes [Eq. @]- For Q > 1/2, the rescaled error 
depends weakly on x, while for a. < 1/2 it scales as x^, with 
P = 1 — a/(l — a), as illustrated in the inset for three example 
values of a. 


It is interesting to consider the limit where the correct 
ligands are rare, a; <C I, as in the case of immune recogni¬ 
tion. Two scaling regimes, illustrated in Fig.0 are found 
depending on the value of the ratio a between the two 
binding constants: 


fix, a) 


ff(a) a > 1/2, .O', 

hia)x^, p = 1- a < 1/2, 


with gia) = (2a — 1)/(1 — a)^ and ft.(a) = (1 — 
a)a“T^ sin(7ra/(l — a))/7r, and 0 < /3 < 1. Since 
x^ <C fix,a), the error in c reduces to: 


{5c^) « gia) 


{5c^) « /i(a) 


^tot 


4£)a(l -p)iVT 
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a > 1/2, 
a < 1/2. 


(9) 


In the hard discrimination regime (a > 1/2), incorrect 
ligands dominate the error, which is governed by Ctot 
as in Eq. 0. The prefactor gia) diverges at a = 1, 
as expected when the two ligands have the same bind¬ 
ing constant and are thus indistinguishable. By con¬ 
trast, in the easy discrimination regime (a < 1/2) the 
error is governed by a weighted geometric mean be¬ 
tween c and Ctot- In the limit of very small a, corre¬ 
sponding to clearly distinguishable ligands, the error is 
{5c^) « c/[4£)a(l — p)NT )]—precisely the error when no 
interfering ligand is present [8]. Eor x = 0, the maximum- 
likelihood estimate may infer a small x* = Sx > 0. How¬ 
ever, the second derivative of the likelihood diverges at 
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a; = 0 for a < 1/2, indicating that the Cramer-Rao 
bound Q fails to give a correct estimate of this error, 
which instead scales anomalously with the number of 
events: 6x ~ n““^, hence Sc ^ c“ot[4Ila(l — p)iVT]““^ 
(App. A.2). 

In many situations, it is more useful for the system 
to determine the presence of the correct ligand rather 
than its precise concentration, as in the recognition of 
foreign pathogens by immune receptors. This decision 
can be made optimally (in the Bayesian sense) by com¬ 
paring the likelihoods of the two competing hypotheses: 
presence versus absence of the correct ligand at fraction 
X. The presence of the correct ligand is detected when 
ln[P({ti}|a;)/P({ti}|0)] = £ 2 ( 2 :) > 9, where 9 is an ad¬ 
justable parameter controlling the balance between the 
false-positive and false-negative error rates FP and FN. 
These errors decay exponentially fast with large numbers 
n of binding events, and can be estimated in that limit 
using a saddle-point approximation (App. Bl): 


exp [n</(A) - X9] A 

Ay27rn|</"(A)| ’ ’ 


( 10 ) 


where (/(A) 


In 


due "^W — x + axe^^ a)«jA 


, and 


where A satisfies the saddle-point condition 9 = n(p'{X). 
The receiver operating characteristics (ROC) giving the 
dependency between FP and FN can thus be estimated 
parametrically by varying A. This saddle-point approxi¬ 
mation is well verified by numerical simulations (Fig.|^. 

As in the case of concentration sensing error, a scaling 
transition is found in the limit of scarce correct ligands, 
a: 1. When a > 1/2, one obtains 


p-\P-nx^ ! g{a) -\{\-Xfnx'^ j g{a) 

FP K. FN « 

\/2'ENnx'^!g[a) ’ — X)‘^nx‘^/g{a) ’ 

( 11 ) 

while when a < 1/2 both error rates decay as ^ 
(na:^)“^/^ exp[—with 7 = (1 — a)~^, 1 < 7 < 
2 and C a function of a and A (App. B.2). The 
time T necessary to make a reliable decision scales as 
[4iAa(l — p)A^]“^Ctotc“^ for a > 1/2, and as [4Zla(l — 
p)N\~^cl~^^c~^ for a < 1/2. Equivalent scaling laws 
were obtained in [33] for minimal on-the-fly detection 
times. 

Can biological systems approach the physical bound on 
concentration sensing given by Eq. Q? To gain insight 
into this question, one can expand Eq. ([^ at first order 
in X to get an approximation to the maximum likelihood 
estimate when a > 1/2 (for a < 1/2 this expansion gives 
quantities with diverging means and cannot be used): 


X 


* 


2 q ;-1 1 
(1 — a )2 n 


- 1) . 


( 12 ) 


This estimator, which is subject to the same asymptotic 
error as in Eq. ([^, suggests a simple strategy, where each 
receptor signals “positively” with a rate that depends on 



FIG. 3: Error in the detection of the correct ligand. 

Numerical and analytical estimate of the rate of false positive 
(FP) versus false negative (FN) errors in the detection of a 
small fraction x of correct ligands, for n = 10® and (a) a = 
0.3 and x = 0.003 and (b) a = 0.7 and x = 0.045. The 
black curve is the result of a numerical experiment, repeated 
5 • 10® times in presence of the correct ligand, and 5 • 10® 
without, and where a likelihood ratio test C 2 {x) > 6 was used 
with a varying threshold 6. The red curve is the analytical 
prediction from Eq. (lOl. The green to blue curves show the 
performance of optimized networks schematized in Fig.|^ for 
various numbers of receptor states m. 


how long it has been bound, «(!—Q;)r'e^^““^” and “neg¬ 
atively” {i.e. with an opposite effect on the readout, see 
below) through a fixed burst (a — l)i5(t) upon binding, so 
that the net effect of each binding event i on the readout 
molecule concentration is 


dt 


a(l — Q;)r'e^^ * -|-a — 1 = a)r q _ 

(13) 


i.e. exactly the argument of the sum in Eq. (12). 


This idea can be implemented biologically by a cas¬ 
cade of receptor conformational states triggered by bind¬ 
ing, and proceeding irreversibly from states 1 to m, each 
transition to the next state occurring with rate s (Fig.|^. 
The ligand is free to detach from the receptor at any 
time, bringing the receptor back to the unbound state 0 . 
The receptors signal through the production or activa¬ 
tion of two molecules B and D with opposite effects on a 
push-pull network governing the state of a molecule A, 
which provides the final readout for x through its modi¬ 
fied state X*. If one requires that the equilibration of B 
and D are fast, and that X and X* are always in excess 
in the Michaelis-Mentens reactions, then 


dX* 

dt 


N 


1=1 


Aq = const, (14) 


where p{j) is the state of the receptor, and 60 = 
do = 0. For the purpose of this discussion, the inter¬ 
nal molecules B, D and A are assumed to be unaffected 
by biochemical noise, restricting the source of noise to 
the input alone. In this design A* increases indefinitely 
to mimick the sum in Eq. (12) over all events at all re¬ 


ceptors. A more a realistic but equivalent scheme would 
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FIG. 4: Network for sensing the concentration of cor¬ 
rect ligands. Upon binding, each receptor enters a cascade 
of m states along which it proceeds with rate s. The lig¬ 
and can detach at any moment with rate r or r' depending 
on its identity (correct or incorrect), bringing the receptor 
back to the unbound state 0. While in conformational state 
^ = 1,..., m, the receptor activates two enzymes B and D 
with rates and each catalyzing two oppposite Michaelis- 
Mentens reactions of a pull-push network. B and D are as¬ 
sumed to equilibrate fast and to be always limiting in the 
reactions they catalyze, so that dX*/dt oc 6^ — d^. 


involve a running sum over an effective time T, obtained 
by relaxing X* to X with rate ^ T~^ |15) . 

When the number of states m is large and the tran¬ 
sitions between them are rapid, X* can track Eq. (12) 
with arbitrary precision when a > 1/2. In that case, the 
receptor state ^ provides an approximation to the time 
since binding, ^ « st. Then, for example, receptors sig¬ 
naling positively with rate 6^ oc a(l — « 

a(l — and negatively with rate oc (1 — 

a){s/no)e~^^^° « (1 — a)S{t) (with an adjustable pa¬ 
rameter) would exactly realize Eq. ( [T^ and thus the es¬ 
timator of Eq. (121 in the limit s/r ^ fj,Q ^ 1. 

Although such optimal performance is only reached for 
large m and a > 1/2, this network design may still per¬ 
form well in more general situations. One can optimize 
the expected error produced by this network over the 
net signaling rates (bf^ — d^), with the constraint that 
the mean effect of binding incorrect ligands on X* be 
zero, so that AX* oc c on average (App. C). Fig. ^ 
shows how the performance of such optimized networks 
approaches the theoretical bound as the number of states 
m increases. The convergence is significantly worse for 
a < 1/2 at small x. In that regime, the estimator of (12) 
is not valid, suggesting that this network design may not 
achieve the optimal bound even with an infinite number 
of states. The output of these networks can also be used 
to detect ligands. Their performance in doing so is com¬ 
pared to the optimal discrimination errors of Eq. (10) in 
Fig. ([^. 

The principle of maximum likelihood not only yields 
the fundamental bound on the accuracy of discerning 
cognate ligands from spurious ones, but also suggests bio¬ 
chemical solutions to approach this optimal bound. Such 
maximum-likelihood inspired designs have been previ- 



FIG. 5: Network performance. Error made by optimized 
networks with a finite number of receptor states m (green to 
blue curves), compared to the theoretical bound (red curve), 
for (a) X = 10~^ and (b) x = lO”"^. 


ously proposed in the case of a single ligand [13[IS]. The 
network structure proposed in this study (Fig.|^ is rem¬ 
iniscent of kinetic proofreading schemes and their gen¬ 
eralizations, which provide a well-known solution to the 
ligand discrimination problem |26l |28l |29l [32l [35] . An im¬ 
portant difference is that here signaling occurs during all 
steps, albeit at various, fine-tuned rates, and with poten¬ 
tially negative contributions, the role of which is to buffer 
the effect of wrong ligands. Consistent with this predic¬ 
tion, it was shown that a negative interaction through 
a diffusible molecule between kinetic-proofreading recep¬ 
tors could mitigate the effects of large numbers of incor¬ 
rect ligands in a discrimination task |26] . 

The present results are relevant beyond the particu¬ 
lar case of sensing by receptors, and apply to any kind of 
biochemical signaling in presence of competing ligands or 
“cross-talk.” This is the case for example in the context 
of gene regulation, where competing transcription fac¬ 
tors may bind regulatory sites unspecifically, a problem 
particularly acute in metazoans |3f)j . 

The scaling transition occurring at the binding con¬ 
stant ratio a = 1/2 suggests that different strategies 
should be employed depending on how hard the discrim¬ 
ination task is. In particular, the approximate but bi¬ 
ologically implementable estimator of Eq. (12) curiously 
breaks down in the easy discrimation regime, a < 1/2. In 
that regime, the optimal bound is harder to achieve be¬ 
cause it is dominated by rare, long binding events that are 
hard to encode by biochemical solutions. The example of 
immune recognition falls precisely into that regime, with 
a binding constant ratio a between agonist and nonago¬ 
nist ligands ranging from one fifth to one third |27|. More 
elaborate network designs, probably with feedback, may 
be needed to achieve the theoretical bound Eq. ([^ in 
that case. Finally, this study has assumed throughout 
that the unbinding rates r and r' are priorly known to 
the system. Complex mixtures of ligands with unknown 
binding constants would make for interesting generaliza¬ 
tions. 

I thank A. Walczak for her helpful comments on the 
manuscript. While this article was under review, a paper 
treating a similar topic was submitted to the arXiv m- 
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Appendix A: Cramer-Rao bound 


allowing for factorization over datapoints: 


1. The Cramer-Rao bound is tight: a physicist’s 
proof 


In general the Cramer-Rao bound is a lower bound on 
the error made by any unbiased estimator, but it is not 
always certain whether this bound can be achieved. Here 
the maximum likelihood estimate is shown to approach 
the Cramer-Rao bound in the limit of large samples. 

Assume that the likelihood of the data factorizes over 
independent datapoints, 


£ = InP = f(a;, ti), (Al) 

i=l 

where x is the model parameter to be estimated, and 
(ti,..., tn) the series of datapoints. In the specific case of 
receptors binding to two types of ligands, x is the fraction 
of correct ligands, £ the duration of binding event z, and 

^{x, ti) = Inr' — r'ti -|- In ^1 — x -|- . (A2) 

The derivative of £ with respect to x is denoted by 
£'{x,ti) = d£{x,ti)/dx. The maximum likelihood esti¬ 
mate X* satisfies: 


^£(x*,£)=0. (A3) 

This estimator is unbiased: if x denotes the true param¬ 
eter with which the data was generated, then x* should 
give back x on average. Equivalently, 



f-l-OO 


= n 




9x Jq 


= 0 , 


(A4) 


(the last integral is just I because of normalization), 
where (•)£ denote averages over data generated with the 
true parameter x. In other words, the maximum of C 
is reached at x on average. The probability that this 
maximum x* be larger than a certain value x > x is: 


P(x* > x) = P > 0^ 



The Heaviside function 0 can be replaced by its Fourier 
representation: 


0 (x) 


7_oo 27ra; 



dA 

27rzA 




(A6) 


(x > X = 


/ +ioo ” /--l-oo 


' —200 
^+200 


dA 

2m\ 


exp 


r-l-oo 


Tn 


(A7) 


and 


P(x) = - 

__ 

27rz 


dF{x* > x) 
dx 

•+00 


X exp 


dt£''{x, t)e 

r+oo 




(AS) 


^-hoo 

(n - 1) In / 

Jo 


This integral can be evaluated by a saddle-point ap¬ 
proximation in the large n limit: 


J <iAG(A)e“«-') » (A9) 


with 


G(A) = 


£{x,t)-\-X£' {x,t) 


/p dt£''{x,t)e 

(;ie^(£,t)+AC'(a;.Z) 

/•-l-oo 

F(A)=ln/ ^^pi(i,t)+\i'(x,t)_ 
Jo 


(AlO) 

(All) 


The saddle A* is given by the condition that the deriva¬ 
tive of the argument of the exponential with respect to 
A be zero: 

l•+ao 

/ =0. (A12) 

Jo 

At X = X, this condition is satisfied for A = 0. In the 
limit of large samples, x* — x is small and so should the 
corresponding A. One can expand at small x —x > 0 and 
A: 


£'{x, t) « f'(x, t) -I- (x — x)£''{x, t), (A13) 


and 


/•-l-oo 

/ dt [(x — x)f"(x, t) -I- X£'{x, t)^] = 0, (AI4) 

Jo 


yielding A = x — x and: 

VnH 

P(x) « —exp 




-(x — x)^H 

2 ^ . 


(A15) 


with 


H = - 


/•-l-oo f+oo 

/ dtf"(x,t)e^(^’*) = / dt£'{x,tfe^^^'^l 

Jo Jo 

(A16) 
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A symmetric argument gives the same result for x < x. 
The resulting distribution of x* is Gaussian, with mean 
X and variance 


(fe 2 ) = 


1 

nH 


/ d'^C{x) 
\ dx"^ 


-1 


/ dC{x) 
V dx 


In the specific case of Eq. (|A2|), 
H = 


r+°° , (fyp{i-a)r't _ l'i2 

0 1 + a;(Q;e(i-“)’'* - 1) 


(A17) 

(A18) 


Performing the change of variable u = r't yields the result 
of the main text: 


{5x^) 


f{x,a) 


n 

+O0 


/.-poo / 

with f{x,a)~^ = / due~^- 

Jo 1 


ae 


(!-“)“ _ 1)2 


— X + xae 


(1 —a)ii 


(A19) 


2. Small X limit 


For a > 1/2 and small x, the integral in Eq. (A19) can 
be approximated by: 


p + OO 

f{x,a)~^ ~ — 1 )^ = 

JO 


2 _ ( 1 -a)' 


2 a- 1 
(A20) 

For a < 1/2, the function e““(ae^^““^“ — 1)^ is not 
integrable, and the denominator of Eq. ( A19| ) is neces¬ 
sary to ensure integrability at large t, however small x is. 
Thus, the values of u governing the behavior of the inte¬ 
gral satisfy xae^^~°‘^'^ = 0(1). This observation suggests 
the change of variable y = a:ae^^““^“: 

f{x,a)~^ = x~^aT^[ dyy~^ 


1 — Q 


l-\-y — X 
(A21) 

with (3 = 1 — a/(l — a). Expanding {y — gives 
three terms scaling as x~^y~°‘/^^~°‘\ and 


x2-Py-Y^ at small y, respectively. The last two give 
diverging integrals as x —>■ 0 for all a, yielding terms of 
order 1. Only when a > 1/2 does the first term give a 
diverging integral, and thus a term of order 1 in x; in 
that case, the sum of all three terms gives back the re¬ 
sult of Eq. (A20). If a < 1/2 however, the first term is 


integrable and thus dominates the expression for x <C 1, 
yielding: 


/(x, a) ^ = X ^a’ 


1 


^ + O 0 


dy 


y 


0 ( 1 ), 


1 - a Jo 1 + y 

(A22) 

where 0(1) denotes a term of order 1 at small x. The 
integral can be calculated: 


^+oo 


dy 


1 + y sin^TTG^ 


to finally obtain: 
f{x,a)~^ « 


1 — a 


sin 


(A23) 


(A24) 


In the intermediate case a = 1/2, the three terms in 
the integral of Eq. (A21) are of order y~^, xy~^ and 
x^y~^. Again the last two terms diverge in the integral 
and give contributions of order 1. The first term also 
diverges, but its contribution reads: 

aT^-^(-ln(ax)) = ln(x/2)|, (A25) 

1 — a 2 

so that: 


fix,a) ^ =-|ln(x/2)|-H0(1). (A26) 

When X = 0 and a < 1/2, /(x, a) = oo, as the large 
deviation function of x* becomes nonanalytic. The ex¬ 
pansion of £' in Eq. (A13l is no longer integrable when 


done aro und x = 0, and needs revisiting. The integral in 
Eq. (|A7|) reads: 


^+oo 


due “exp 


A(ae 


(1 —a)u _ 


1 ) 


1 -|- x(ae(i““l" — 1) 


/•+00 


= 1 


due 


exp 


A(ae 


(1 —a)w _ 


1 ) 


1 -|- x(aeG““l“ — 1) 


- 1 -A(ae(i-“)“-l)| 


where the same change of variable u = r't has been done. Doing a further change of variable to y = axe^^ yields: 

■(A/x)(j/ - x) 


(ax) 

1 — a 



r r 

/ y 

fax 

{ exp 


1 + y — x 


-I - i\/x)iy- x) 


(A27) 

delds: 

(A28) 


The term is the brackets is of order {y — xf, as was the case in Eq. (A19). Hence, terms in y^ are integrable and 
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dominate the expression, which becomes at leading order 


,A: 

(ax)T^ /■+'- f 

r (.X/x)y'\ 

1 - a io ^ 

1-by , 


1 + - / ?/ exp _ 1 _ {\/x)y 

(A29) 

With A = \x, the saddle-point condition becomes: 

ip'{\a) = Q, (A30) 

with 

ijj{X,a) = - 


1 — a 


y i-“ — 1 — \y^ (A31) 


and the cumulative probability distribution is: 
1 


2-Knx q;)A 


> x) «— . — exp —nx^-°‘%l^{X,a) 

(A32) 

where ip' = dip/dX and ip” = d^ip/dX^. Fluctuation of 
X* are thus of order n°‘~^ Ij^/n. 

When a = 1/2, the term of order {y — xY in the brack¬ 
ets of Eq. (A28) dominates and diverges, so that this 


expression reduces at leading order to: 
l+y|ln(x/2)| 


(A33) 


The saddle point condition gives A = 1 and one obtains: 


P(x* > x) 


1 


: exp 


nx‘^ \ ln(a:/2)| 


a/ttux^I ln(a;/2)| 

rA34) 

which implies fluctuations of order 5x ~ (nlnn)“^/^. 


Appendix B: Probability of discrimination error 
1. General case 


The integral representation of the Heaviside function, 
Eq. (A 61 can be used again to obtain: 




FP = 

FN = 


—too 

/>+ioo 


dX 

27riX 

dX 

2TTiX 




xe 


y^^^{l-XWx,t)+Xi(0,t) 


(B3) 


Substituting A —)■ (1 — A) in the second equation gives an 
expression for FN that looks very similar to FP: 


r dx 
J 2TTi{l - A)' 

In summary: 


,(i-A)e 


dtY^-^)Ho,t)+x.i(x,t) 


(B4) 


^+200 


FP = 

FN = 


dX 


„n0(A)-AS 


— too 
/‘+200 


27riA 

dX 


(B5) 


27rz(l — A) 


„n^{X) + {l-X)e 


with: 


0(A) =\nj 


= In / due 


1-b - 1) 


(B6) 


These two expressions can be evaluated in the large n 
limit using a saddle-point approximation, with the same 
saddle-point condition 9 = np'{X) for both, yielding: 


FP: 

FN : 


1 


A-\/27rn|0"(A)| 

1 


(1 - A)A/27rn|0"(A)| 


exp [n0(A) — A6*], 

exp [n0(A) -b (1 — A)0]. 


(B7) 


The discrimination between two competing 
hypotheses—presence versus absence of the correct 
ligand in fraction x —can be performed by a likelihood 
ratio test: 

P(tt’-Pi!\^=0) = |: <.) - «(0. ()1 > e, 

(Bl) 

where 9 is an adjustable parameter. The false-positive 
and false-negative error rates are defined as the proba¬ 
bilities of detecting the presence of a ligand that is in fact 
absent, and of missing it where it is there: 

/ n / " \ 

^ [£{x, U) - m i)] - 0 , 

2-1 \ 2 -l / 

P n / n \ 

E7V = y ( ^[f(O,t,)-4x,t)]-b0j . 

(B2) 


2-1 


^ 2-1 


2. Small X limit 

Again two regimes emerge in the x ^ 1 limit, depend¬ 
ing on whether a is smaller or greater than 1/2. When 
a > 1/2, 0(A) can be expand at small x: 


'/(A)«-^^^^^A(1-A)x1 (B8 ) 


This implies: 


FP . 


FN 


with 


g-iA^n£c^/ 3 (a) 

y/2TrX‘^nx^jg{a) ’ 
•\/27r(l — XYnx'^/g{a) ’ 


(B9) 


= (BIO) 











































When a < 1/2, one can do the same change of variable 
as before, y = to obtain at leading order: 

</(A) = —- / dyy~T^ [(l + y)^-l-Ay] . 

1 - a do 

(Bll) 

The integrand is of order y “/at small y, and there¬ 
fore is integrable. The error rates are then given by: 


FP : 


FN : 


exp 


nxi-o (x(A) - Ax'(A)) 


exp 


AY27rnxi-“ |x"(A)| 
nxT^ (x(A) + {1- A)x'(A)) 


(B12) 


(1 - X)\l2TTnx^-°‘ |x"(A)| 


where 

/*+c>o 

X(A) = :j- / dy y~^ [{I + y)^ - I - \y] . 

1 - a do 

(B13) 

The intermediate case a = 1/2 is treated similarly as 
before, by noting that the integral defining <j){X) is dom¬ 
inated by the (diverging) term of order y~^. This gives: 


^(A)«-^|ln(x/2)|A(l-A). 
and therefore: 

ln(a:/2)| 

y^TrA^rnc^lln/xT^' 

ln(a;/2)| 

FN « . 

■\/7r(l — A)2na;2| ln(a;/2)| 


(B14) 


(B15) 


As a result, the number of binding events n necessary 
to a make reliable decision scales as x~‘^ for a > 1/2, 
a;“^| ln(x)|“^ for a = 1/2 and x~~* for a < 1/2, with 
7 = (1 — a)~^. Replacing n « 4 Zlactot(l — p)NT gives 
the scaling for the minimal detection time: 


T 


1 

ADa{l-p)N 


ctotc”^ a > 1/2, 

CtotC“^| ln(c/ctot)r^ « = 1/2, 

Ctot^c-^ a < 1/2. 

(B16) 


Appendix C: Optimization of the signaling rates in 
the receptor cascade 

Each receptor goes through a cascade of states p = 
1 ,..., m upon binding. At any moment, the receptor can 
become unbound with rate r or r'. In the following some 
expressions will be given in terms of the unbinding rate 
of the correct ligand r, but the same expressions hold for 
the incorrect ligand after substitution by r'. 

The probability of reaching state p is [s/(s -I- r)Y~^. 
Assuming it has reached state p, the time spent in 


that state is distributed according to (s-l-r)e In 

summary is distributed as follows: 


Pritt^) = 




(s -I- r)^-2 


^-{s+rpf, _|_ 


1 - 


M- 1 ' 


Sit,,), 


(Cl) 

where 6{x) is Dirac’s delta function. Its first and second 
moments are: 


{t,i)r 

K)r 


gM -1 

(s-l-r)'^’ 

gP.-l |-^ gM-l 

(s-g^)M+i [ + 


The output of the network is given by: 


dX* 

dt 


N 

A^o ~ 

3 = 1 


(C2) 

(C3) 


(C4) 


where pij) is the state of the receptor and bo —do = 0, 
so that the net effect of one binding event is 

m 

NX* =XoY,ib„-d„)t„. (C5) 

/i^l 


On average, binding a wrong ligand will cause a change 


(AX*),, 


Ao y^jbf^ 

,1=1 


d,i) 


g/x-l 

(s-kr')^' 


(C6) 


When optimizing over the net rates (5^ — d^), this quan¬ 
tity is set to zero, to ensure that only the correct ligand 
changes X* on average. This way, X* is proportional to 
c in the limit of long times: 

(X*(T)) « 4Da(l - p)NTc{AX*)r, (C7) 

Although the mean of X*{T) is not affected by incorrect 
binding events, its variance is, and reads: 

(X*(T)2) - (X*(T))2 « 4Da(l - p)XTctot 
X [x((AX*)2), + (1 - x)((AX*)2),, - x2(AX*)/] , 

(C8) 


where 

m 

((AX*)2), = (AX*)2+x2^(&^-d^)2(5t2)^. (C9) 

,1=1 


and the same for r'. 

For a given m, the signal-to-noise ratio 


SNR = 


(x*(r ))2 


(x*(r) 2 ) _ (a*(t ))2 


(CIO) 


is maximized over the rates h^ — d^. The procedure gives 
the optimized networks discussed in the main text. 
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